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Abstract. IDENTIKIT was originally developed as a fast approximate scheme for 
modeling the tidal morphology and kinematics of disk galaxy encounters and mergers. 
In this form, it was first used to implement an interactive modeling tool for galaxy colli- 
sions; tests with artificial data showed that the morphology and kinematics of merging 
galaxies strongly constrain their initial conditions. This tool is now being applied to 
real galaxies. More recently, IDENTIKIT has been used to develop a mapping from the 
present state of a tidal encounter back to the initial conditions; this offers a way to partly 
automate the search for dynamical models of galaxy encounters. Finally, IDENTIKIT'S 
theoretical applications include a comprehensive way to evaluate the mass and extent 
of tidal features as functions of halo structure. 



1. Introduction 



The tidal theory of galaxy e ncounters e xplains the morphological and kinematic fea- 
tures of "peculiar" galaxies dArplll966h as consequences of gravita tional interactions 
between previously normal disk galaxies dToomre & Toomrelll972l hereafter TT72). 



While this theory has strong theoretical foundations, it gained considerable credibil- 
ity from TT72's plausible simulations of four well-known interacting galaxies. In the 
decades since TT72, the number of systems with detailed dynamical models has grad- 
ually increased. There are several reasons, beyond testing the tidal theory, to create 
models of specific systems: such models (1) help interpret complex three-dimensional 
morphology, (2) provide access to the time domain, and (3) may be used to test sub-grid 
models of star formation, feedback, AGN fueling, etc. Thus, a dynamical merger model 
can provide a framework unifying disparate observations into a coherent picture. 

Finding initial conditions which reproduce a specific pair of interacting galaxies 
is a time-consuming business. The parameter set is large, and parameters interact in 
complex ways. Hours may be required to run a single self-consistent calculation, and 
weeks of trial and error may be needed to obtain an acceptable match. 

Sixteen parameters specify an encounter between two galaxies; these fall into three 
distinct groups as follows. (1) The initial orbit is specified by the pericentric separation 
p, mass ratio \i, and eccentricity e. (2) Disk orientations are specified by inclination 
angles i and azimuthal angles cp. (3) The mapping from a simulation to observables is 
specified by the time since pericenter t, three viewing angles 6 a , length and velocity 
scale factors £, and < V, position zero-point ro and velocity zero-point vo- Together, 
groups (1) and (2) specify the initial conditions. 
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The criteria used to decide if a model matches the observations are subtle. As a 
rule, it's not enough to reproduce the tidal morphology of an interacting pair of ga laxies, 
since different encounter geometries may produce identical morphologies (e.g. iBarnesI 



2011). Kinematic information on tidal bridges and tails provides much stronger con- 



straints. Interferometric HI data, which traces both tidal morphology and kinemat- 
ics, is often used to constrain merger models; Ha may also be useful if tidal features 
contain emission-line regions. Molecular and/or stellar absorption lines can also pro- 
vide constraints, but mapping extended structures in these lines is expensive. Models 
and observations are typically compared visuall y, for example by overplo tting particles 
on various projections of a HI data cube (e.g. iHibbard & Mihoslll995h . Since inter- 
stellar material converts between molecular, atomic, and ionized phases by processes 
outside the scope of purely dynamical models, visual inspection may be more reli- 
able than quantitative measures derived by differencing model and observational data 
cubes. However, visual inspection is tedious and inherently subjective; robust quantita- 
tive methods would certainly be welcome. 

Some progress has been ma de using genetic algorithms to automate the job of 
modeling interacting galaxies (e.g. lWahde(ll998l ; lTheis & Kohldl200ll ; ISmith et al.l2010h . 
Interesting results have been obtained, but most tests to date have searched only limited 
subsets of the relevant parameters and made little use of kinematic information. 



2. IDENTIKIT 1 

IDEN TIKIT simulations combine test-particle and self-consistent techniques (Ba rnes & Hibbard 
2009h . Each galaxy is modeled by an initially spherical configuration of massive par- 
ticles with cumulative mass profile m(r), in which is embedded a spherical swarm of 
massless test particles on initially circular orbits. Two such models with mass ratio fi are 
launched towards each other on an orbit with eccentricity e and pericentric separation 
p. During the ensuing encounter, the massive components interact self-consistently, 
closely approximating the time-dependent potential and orbit decay of a fully self- 
consistent galactic collision. The test particles mimic the tidal response of embedded 
discs with all possible spin vectors; once such a simulation has been run, selecting the 
appropriate subset of test particles yields a good approximation to the tidal response of 
any particular disc. 

Using t his scheme, trial-and-error modeling of interacting galaxies becomes much 
less tedious. iBarnes & Hibbardl (120091) constructed an artificial data set of 36 parabolic 
(e = 1) encounters between equal-mass (jj. = 1) disk galaxies. The encounter ge- 
ometries, pericentric separations, times since pericenter, viewing directions, and scale 
factors were chosen at random. With no knowledge of the actual parameter values, 
IDENTIKIT 1 was used to search for models reproducing the observable morphology 
and kinematics of these 36 systems. Some 30 cases were successfully reconstructed. 
In these cases, all unknown parameters were well-constrained; for example, encounter 
geometry and viewing direction were recovered with median errors of < 15°. 

2.1. Modeling real galaxies 

Having passed "laboratory" tests, IDENTIKIT is ready to apply to real galaxies. This 
immediately raises several new issues. First is the limitations of the observational data. 
For most systems, HI is the only available tracer of large-scale morphology and kine- 
matics. In many cases, the resolution and signal-to-noise of the data barely suffice to 



Experiments With IDENTIKIT 



3 




Figure 1. NGC 2623. Top-left and bottom-right panels show HI and optical im- 
ages, respectively; North is up and West is right. Top-right and bottom-left panels 
show HI position-velocity diagrams; velocity increases to left and down, respec- 
tively. Grey-scale shows emission, while contours show HI absorption. Points show 
a preliminary IDENTIKIT 1 model (Privon et al., in preparation). 



trace tidal structures. Second, the observational data may display features outside the 
scope of simple test-particle models. For example, HI data often includes absorption 
features, and it's not clear if or how such features can be matched by the simulations. 
Third, the mass models used in the IDENTIKIT simulations may not be good approx- 
imations to the actual mass distributions of the galaxies involved in an encounter. At 
this point, the simulations use generic disk galaxy models; tailoring these models to 
specific systems is an important challenge. 

Fig- CD presents a preliminary model of NGC 2623 (Privon et al., in preparation) 
whic h illustrates all three o f these issues. First, while HI is definitely detected in the 
tails ( Hibbard & Yun| [l996), its emission is easier to map by taking the maximum voxel 
value along each line of sight through the data cube; summing the emission tends to 
produce noisy results. It's not entirely clear how to compare the present maps to the the 
projected particle distribution, which most naturally corresponds to summed emission. 
In addition, HI emission is not detected from the bright star-forming region to the south 
of the main body of NGC 2623; optical spectroscopy may help to determine kinematics 
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in this region. Second, the system exhibits HI absorption, presum ably due to neutral 
hydrogen silhouetted against NGC 2623 's AGN (lEvans et al.ll2008h . It's interesting that 
the velocity width of this absorption feature is fairly well reproduced by the width of the 
particle distribution, but not entirely obvious that this is a success of the model since 
HI on the far side of the nucleus does not contribute to the absorption. Third, while 
NGC 2623 probably results from a merger of comparable galaxies, the hook-shaped 
tail to the West might be better reproduced if its parent galaxy's disk was initially larger 
than its partner's. 

Further experience modeling a number of different systems is needed to explore 
these issues. It would also be interesting to combine HI with optical data, and to in- 
clude other velocity tracers such as Ha or CO. We are currently modeling Arp 240 
(NGC 5257/8); other systems on our short-list include NGC 34 and NGC 3256. Eventu- 
ally we h ope to model a num ber of galaxies in the GOALS sample of luminous infrared 
galaxies (lArmus et al.ll2009T) . 



3. IDENTIKIT 2 



IDENTIKIT 2 (Barnes 20111) uses the self-consistent plus test-particle simulations de- 
scribed above to solve for the initial orientation of each disk. This offers a significant 
shortcut compared to the trial and error technique used with IDENTIKIT 1. The main 
assumption required is that the tidal features associated with each galaxy can be traced 
back to a single disk with a unique orientation. 

To see how IDENTIKIT 2 works, consider a single region of phase space with fi- 
nite extent in X, Y, and Vz and infinite extent in the remaining dimensions. This region, 
hereafter called a "box", is placed so as to sample the tidal material from a particular 
galaxy. At some time post-encounter, a single pass through the test-particle array for 
that galaxy selects all particles falling within the box. Each particle has been labeled 
with its initial spin axis relative to its parent galaxy, so the selected particles define a 
density distribution on the unit sphere of all possible spin directions. As a rule, this 
distribution is extended and does not define a unique orientation for the parent disk. 
However, another box sampling tidal material from the same disk will generate a dif- 
ferent distribution, and the two distributions must overlap at the disk's true orientation. 

Given boxes tracing tidal features from both galaxies, IDENTIKIT 2 can solve 
for the inclinations and azimuths of both disks, directly determining four of the sixteen 
encounter and viewing parameters. The remaining twelve, however, must be specified 
beforehand, and if the specified values are wrong then the derived disk orientations will 
probably be wrong as well. But if at least three boxes - ideally more - are used to 
sample each disk, the effects of parameter mismatch tend to destroy the mutual overlap 
of the distributions. By quantifying how well the distributions overlap - for example, 
by forming the product of the densities they trace - a relative figure of merit for different 
solutions is obtained. 

With a fairly robust method of measuring quality of fit, it's possible to implement 
automa t ic sear ching over some subset of the twelve parameters besides disk orientation. 
Barnesl ( 201 lh described an implementation designed for systems composed of two 



disk galaxies which have not yet merged. In this version, just six parameters - the 
initial orbit (p, fi, e), time since pericenter t, velocity scale < V, and zero-point vo - must 
be specified ahead of time. In addition to a set of boxes tracing the tidal features of 
each galaxy, the algorithm requires coordinates on the plane of the sky for both galaxy 
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Figure 2. NGC 4676. Top-left and bottom-right panels show HI and optical im- 
ages, respectively; North is up and West is right. Top-right and bottom-left panels 
show HI position-velocity diagrams; velocity increases to left and down, respec- 
tively. Boxes show constraints used to determine disk orientation. Points show 
IDENTIKIT 2 model. 



nuclei. It performs a blind search over all possible viewing directions (8x, 9y)\ for 
each direction, the length scale X, rotation about the line of sight Oz, and position zero- 
point ro are determined by requiring the centers of the model galaxies to match the 
observed positions. Fits to both disks are scored independently as described above; the 
viewing direction which maximizes the product of both scores is selected. In tests with 
a small ensemble of simulated random mergers, the algorithm reconstructed encounter 
geometry and viewing direction with median errors of < 8°. 

Fig.|2]shows that the present algorithm already has interesting real-world applica- 
tions. Here, four regions are allocated to each tail of "The Mice", NG C 4676; these re- 



gions track the HI morphology and line-of-sight kinematics of the tails (Hibb ard & van Gorkom 



1996). In addition, the solution was constrained by requiring the simulated nuclei 



(crosses) to match the observed positions, and fall within the observed range of sys- 
temic velocities. Six parameters were specified a priori: the orbital eccentricity (e = 1), 
mass ratio (ji = 1), pericentric separation (~ 4.5 disk scale lengths), time since peri- 
center (~ 1.25 disk rotation periods), veloc ity scale and velocity zero-point are close to 
the values adopted in earlier models (e.g. iBarnes 120041) . However, the remaining ten 
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Figure 3. Fourteen-parameter search for models of NGC 4676. From one panel 
to the next, time since pericenter t increases left to right, and pericentric separation 
p increases bottom to top. Each panel shows a 9 x 8 grid of models in *V and vo. 
Symbol size indicates model score. The 122 highest-scoring solutions are visually 
classified as poor (triangles: N = 30), fair (squares: N = 56), and good (circles: 
N = 36); low-scoring solutions are shown as grey crosses. 



parameters were all derived by the algorithm, which required about two minutes on a 
2.16 GHz processor to examine 5120 viewing directions and produce the model shown 
here. This model is about as good as models of NGC 4676 derived by hand. Other 
"well-separated" systems which can be modeled in the same way include Arp 256, 
Arp 295, and Arp 298; HI data is available for all three. 

3.1. Uniqueness and uncertainty 

IDENTIKIT 2 is fast and robust enough to go beyond the ten-parameter search de- 
scribed above. For a system like NGC 4676, it's reasonable to fix the mass ratio fx — 1, 
adopt an e = 1 (parabolic) initial orbit, and perform a blind search in pericentric sep- 
aration p, time since pericenter t, velocity scale *V, and velocity zero-point vo, as well 
as the angles Ox and 9y defining direction of view. As the number of parameters to be 
determined increases, the algorithm yields solutions with comparable overall scores for 
many parameter combinations. In Fig. [3] solutions have been visually graded as poor, 
fair, or good representations of NGC 4676. It's clear that numerical score is an imper- 
fect indicator of quality, since the highest-scoring solutions (in the t = 0.75, p = 0.25 
panel) are classified as fair, but many good solutions do get high scores, while poor 
solutions consistently get low scores. 

The high-scoring solutions in Fig. [3] are not scattered at random; most fall along 
a diagonal from lower left to upper right across the panels, indicating a general corre- 
lation between time since pericenter t and pericentric separation p. Other parameters 
which correlate with t include the velocity zero-point vq, viewing angles (%, 6y), and 
length and velocity scale factors X and *V. These correlations indicate that acceptable 
solutions populate an "error ellipse" in parameter space. As a group, these solutions 
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are relatively homogeneous; they all appear to originate from a single connected region 
of parameter space. Moreover, the physical time since pericenter, T = (JLI'V)t, is very 
well constrained; all of the good solutions and almost all of the fair ones yield T values 
between 150 and 200 Myr. 

Although it may seem better to obtain a unique model, the ensemble of solutions 
for NGC 4676 shown in Fig.[3]is probably a good representation of the actual uncertain- 
ties inherent in modeling this system with available HI data. There's no telling if most 
interacting galaxies will likewise yield fairly well-constrained solutions; NGC 4676 is 
a relatively simple system, and others may be harder to constrain. 

This 14-parameter search shows that comprehensive surveys of the encounter pa- 
rameter space are possible. While the internal structures of the victim galaxies remain 
to be parametrized (§ 4), the ability to search such large spaces is encouraging. 

4. Mass models 

The results presented above use a generic galaxy model with a bulge+disk:halo mass 
ratio (mb + m&) : - 1 : 4. While this model is adequate for some purposes, the 
halo comprises just 80% of the total, which is less than expected in CDM cosmologies. 
Moreover, any effort at data-driven modeling of real mergers must address variations in 
initial galaxy structure. How sensitive are model results to inevitable discrepancies be- 
tween the adopted and actual structure of the progenitor galaxies? Can tidal encounter 
models accurately probe the overall depth and structure of halo potentia l wells? 

Existing work on the effects o f halo structure on tidal features (Dubinsk i et al. 
19961 . 1 1 9991 : ISpringel & White! 1 19991) has largely focused on direct, co-planar encoun- 
ters which maximize tidal features; a systematic study varying encounter geometry 
as well as galaxy structure is bedeviled by a large number of parameters. However, 
IDENTIKIT offers an efficient way to treat encounter geometry, since a single simula- 
tion simultaneously models all possible disk orientations. Fig. @] presents an example, 
based on a galaxy model with mass ratio (m\, + m^) : = 1 : 9. The plot shows test 
particles which have attained a maximum distance from their parent galaxy of at least 
three times their initial orbital radius; these particles populate tidal features. Particles 
are classified as belonging to bridge or tail structures based on their position relative to 
the two galaxies at (1) first passage and (2) instant of maximum distance; a very few 
particles don't belong to either. No selection for initial disk orientation is done, so the 
result is a "synoptic" view of all tidal structures resulting from this encounter. 

The histogram on the right of Fig. |4] shows how the mass fractions /und in bridges 
and tails depend on disk inclination i. At small inclinations (i < 30°), the bridge is 
relatively massive, comprising ~ 8% of the disk material. However, bridge mass drops 
rapidly with increasing i. The tail is less massive, amounting to ~ 2.5% of the disk 
material for small i, but drops off less rapidly with increasing i. In effect, bridge-to-tail 
mass ratio is a decreasing function of inclination; this trend would be very difficult to 
quantify using conventional simulations. 
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showing bridge and tail mass fractions as functions of disk inclination i. 
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